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Abstract — The study of codes for powerline communications 
has garnered much interest over the past decade. Various types 
of codes such as permutation codes, frequency permutation 
arrays, and constant composition codes have been proposed 
over the years. In this work we study a type of code called 
the bounded symbol weight codes which was first introduced 
by Versfeld et al. in 2005, and a related family of codes that 
we term constant symbol weight codes. We provide new upper 
and lower bounds on the size of bounded symbol weight and 
constant symbol weight codes. We also give direct and recursive 
constructions of codes for certain parameters. 

Index Terms — Asymptotic bounds. Constant composition 
codes, Powerline communications, Reed Solomon codes. Symbol 
weight codes. 

I. Introduction 

The notion of transmitting data over powerlines has posed 
an interesting challenge for information and coding theory. 
The noise characteristics of such a communication channel 
include permanent narrowband noise, impulse noise and white 
Gaussian noise. Communication over this channel also has 
an additional requirement that the power envelope be as 



close to constant as possible. Vinck |31 1 studied this channel 



and showed that i\/-ary Frequency Shift Keying (i\f-FSK) 
modulation, in conjunction with the use of permutation codes, 
provides a constant power envelope, frequency spreading and 
redundancy to correct errors resulting from the harsh noise 
pattern. This has since resulted in research on Frequency 
Permutation Arrays (FPAs) and constant composition codes 
(CCCs) which retain the property of a constant power envelope 
(see |[5|-|[T6|, and [14J for a survey). Every codeword of 
an FPA or a CCC has the requirement that the frequency 
of each symbol is fixed by the parameters of the code. 
Versfeld et al. [29] introduced the notion of the "same- 
symbol weight" of a code by relaxing the requirement that 
every symbol must occur a fixed number of times in any 
codeword. In every codeword of a same-symbol weight code, 
the frequency of any symbol is bounded. Even with this 
relaxation it is possible to detect permanent narrowband noise. 
Versfeld et al. [29 J , [30J used Reed-Solomon codes to design 
codes with specified same-symbol weight. 

In this work we mostly study the asymptotic behavior of 
codes in the symbol weight space. We use the term bounded 
symbol weight (as opposed to same-symbol weight (29)) to 
denote all words in the Hamming space with bounded symbol 
weight, that is, any symbol in a codeword does not occur 
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more than a fixed number of times, say r. This terminology 
is adopted in order to distinguish this space from the constant 
symbol weight space, in which every symbol in a word in the 
Hamming space occurs at most r times and there exists one 
symbol which occurs exactly r times. The constant symbol 
weight space is clearly a subset of the bounded symbol weight 
space. We also use the term symbol weight space to refer to 
either the bounded symbol weight or constant symbol weight 
space. The actual space being referred to is made clear from 
the context and notation. 

As described in | [29) , pO) , the symbol weight determines 
whether the code can detect and correct narrowband noise in 
the powerline channel. An FPA or a CCC belongs to some 
constant symbol weight space. The constant symbol weight 
space also contains other compositions all of which have the 
same maximal part, that is, all such codes have the same 
fixed symbol weight. Thus a code in the constant symbol 
weight space is larger than a CCC of a fixed composition, 
and is still relevant for correcting narrowband noise. The 
asymptotic behavior of FPAs have been studied in Q, |[8j, 
|14|. In contrast, there are relatively fewer results on the 
asymptotic behavior of CCCs (see 123), [28 1). We consider 



familiar techniques used to derive classical bounds such as 
the GV bound, the Johnson bound and the Singleton bound, 
on codes in the symbol weight space. However, the derivation 
of these results are not immediate because of the lack of any 
reasonable structure in the symbol weight spaces. In particular, 
even the Hamming balls of a fixed radius in these spaces 
depend on the center of the ball. In Section |V] we also study 
non-asymptotic bounds on codes in the symbol weight spaces 
by expressing them in terms of different CCCs. This also 
raises related combinatorial questions regarding the size and 
construction of optimal codes, which can be an interesting 
avenue of future research. In later sections we show that there 
exists codes, which are subsets of Reed-Solomon codes, with 
high rate and relative distance, which are a subset of the 
constant symbol weight space. 

Throughout this work we are mostly concerned with codes 
that have positive rate and positive relative distance. Hence we 
do not study codes with very large distances, in the Plotkin 
region. We start with some basic definitions and notations in 
the following section. We devote Section [III] to deriving the 
exact and asymptotic size of the symbol weight spaces. The 
results in Section [III] allow us to determine which constant 
composition space contained within the symbol weight space 
contributes the most to the rate of a symbol weight space. 
These estimates are used in Section|lV]to determine upper and 
lower bounds on bounded symbol weight and constant symbol 
weight codes. In particular, it is clear that asymptotically some 
constant composition code determines the rate of a symbol 
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weight code. An upper bound is readily obtained from either 
the Singleton bound or the Linear Programming bound in the 



Hamming space. In Section IV we also provide a Johnson- 
type bound on codes in the constant symbol weight space, 
and use this bound to derive an asymptotic improvement 
of the Singleton bound and the Linear Programming bound 
for certain ranges of the minimum distances and the symbol 
weight. In Section [V] we provide non-asymptotic lower bounds 
on symbol weight codes. We introduce a new metric on the 
space of compositions of an integer and use this metric to 
lower bound the size of symbol weight codes by a sum of sizes 
of CCCs. Finally, in Section|VT]we provide other constructions 
of constant symbol weight codes, and in particular show that 



with composition n = [A, . . . , A]. Thus the FPA is a subset 
of the constant symbol weight space with symbol weight A. 
Similarly, it can be seen that a CCC with composition n = 
[riQ, . . . , Uq^i] is a subset of the constant symbol weight space 



the asymptotic lower bound presented in Section III is tight 
for certain parameters, for subcodes of Reed-Solomon codes. 

II. Preliminaries 

Let Zg — {0, . . . ; 9 — 1} denote an alphabet set of q 
elements. We consider symbol weight codes in the Hamming 
space Zg = {0, . . . , g — 1}". The symbol weight of a word is 
defined as the maximum of the frequencies of occurrences of 
symbols in the word. For instance, the all-0 word has a symbol 
weight of n. The bounded symbol weight space with symbol 
weight r is the set of all words with symbol weight at most r. 
This space is denoted by SW{n, q, < r). The bounded symbol 
weight space is termed as "same-symbol weight space" in the 
works of Versfeld et al. |29|, |30J . We adopt this terminology 
to distinguish this space from the constant symbol weight 
space that we define next. In the constant symbol weight space 
every word has a symbol weight of exactly r. This space 
is denoted as SW{n,q,r). If every symbol occurs in each 
codeword we can use the Pigeonhole principle to get the lower 
bound r > [n/q]. Since any word with this lowest value of 
symbol weight contains the least repetition of any symbol, 
these words are considered as ones with the optimal symbol 
weight. 

In this work we study codes in the bounded and constant 
symbol weight spaces. A bounded (resp. constant) symbol 
weight code is a subset of the bounded (resp. constant) sym- 
bol weight space. Let A;^^(n, d, < r) (resp. A^^(ri, d, r)) 
denote the maximum size of a bounded (resp. constant) 
symbol weight code with distance d in SW{n, q, < r) (resp. 
SW{n,q,r)). We denote a composition of n into q non- 
negative parts by n = [no,ni, . . . ,nq^i]. The constant 
composition space with composition n — [no,ni, . . . 
is a subset of Z^ in every word of which the i-th symbol 
occurs exactly rii times. A constant composition code is a 
subset of a constant composition space. We use the notation 
Ag{n, d) to denote the maximum size of a code in the constant 
composition space given by the composition n and minimum 
distance at least d. We use the notation Aq{n, d) to denote the 
maximum size of a code with minimum distance at least d in 
the Hamming space. A code 6 of length n, size A/, distance d, 
over Zq is denoted by C(n, M, d)q. If 6 has a constant symbol 
weight r it is denoted by C(n, Af , d, r)q. If C is a linear code 
of dimension k over a field Fg it is denoted as C[7i, fc,d]g. 

An FPA consists of vectors in which every symbol occurs 
a fixed number, say A, of times. Hence, an FPA is a CCC 



with symbol weight r — maxjrii 



0, 



!}■ 



The coded modulation scheme introduced by Vinck 1 3 1 1 
for the powerline channel considered M-FSK modulation 
along with the use of permutation codes. The demodulator 
considered is a hard-decision demodulator consisting of an 
envelope detector with a threshold. At every time instance, 
the demodulator provides a multivalued output consisting of all 
the symbols that correspond to frequencies at which the output 
of the envelope detector exceeds the threshold. A narrowband 
noise in this context results in the same symbol appearing at 
all time instances. As explained in Versfeld et al. |29J, a linear 
code is less effective in this channel. For instance, the all-zero 
codeword can not be distinguished from a narrowband noise. 
Hence, permutation codes, FPAs and CCCs are more suitable 
for communication in this channel. To understand why we 
study the constant symbol weight space, consider the following 
example. 

Example 2.1: Consider a CCC in Z4 with composition 
[1,1,3,3] and minimum distance d ~ 4, that is suitable 
for correcting narrowband noise in a powerline channel. 
Let (0,1,2,2,2,3,3,3) be a codeword in this CCC. Then 
the vector (0,0,0,1,1,1,2,3) is also a suitable vector for 
correcting narrowband noise. However this vector belongs to 
a different constant composition space that contains vectors 
with composition [3, 3, 1, 1]. Both these constant composition 
spaces are a subset of the constant symbol weight space 
with symbol weight 3. In Section [V] we prove that since 
the compositions [1,1,3,3] and [3,3,1,1] have distance 4 in 
a specific metric that we define later, any vector from the 
constant composition space with composition [1,1,3,3] will 
be at a distance at least 4 from any vector of the other space 
with composition [3,3, 1, 1]. Hence, we can increase the size 
of the code by including all the codewords from a CCC in 
the latter space. 

It is clear that any constant symbol weight space with 
symbol weight r can be written as the union of different 
constant composition spaces, each of which contains vectors 
with the same symbol weight r. This relation to the constant 
composition space is used throughout this work. 

In the next section, we first determine the size of the 
constant symbol weight space and the bounded symbol weight 
space. This size is then used to determine a GV-type bound 
on the symbol weight spaces. Unfortunately, the expression 
for the size of the symbol weight spaces is unwieldy and 
gives little insight into the behavior of the lower bounds. 
The symbol weight spaces are also not ball-homogeneous, 
that is, the size of a Hamming ball in the space depends 
on the center of the ball. For example, the bounded symbol 
weight space in Z3 with symbol weight r at most 2 has 24 
vectors. The ball of radius 1 around the vector (1,0,0) con- 
tains 6 vectors, namely, (1,0,0), (2,0,0), (1,1,0), (1,2,0), 
(1,0,1), (1,0,2). In contrast, the ball of radius 1 around 
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(2, 1, 0) contains 7 vectors, namely, (2, 0, 0), (0, 1, 0), (1, 1, 0), 
(2, 1, 0), (2, 2, 0), (2, 1, 1), (2, 1, 2). This fact makes it difficult 
to state decent lower bounds. Similar comments apply to the 
computation of the Hamming bound. Hence, in the following 
two sections, we instead study the asymptotic behavior of 
the symbol weight spaces and the rate of the corresponding 
symbol weight codes. 

In the following section we determine the asymptotic size of 
the symbol weight space. This enables us to determine which 
constant composition space, contained within the symbol 
weight space, has the largest size. 

III. Asymptotic size of the symbol weight space 

To determine the asymptotic size of the symbol weight 
space we first state the expression for the non-asymptotic 
case. Each vector in the symbol weight space corresponds 
to a vector in some constant composition space. Hence we 
introduce some basic definitions below to describe the size 
of the symbol weight space. Let us denote the set of all 
compositions of n into q non-negative parts by that is 



J^ = ineZ'' -.n 



> 0, = n > 

i=0 ) 



and define 



[^(r) = {n e 3\f : maxjrio, . . . , riq^i} = r}, 
r) = {n E : max{no, . . . , < r}. 

Let P{N,K, R) denote the compositions of N into K parts, 
each part bounded between and R. An expression for the 
size of P{N,K,R) is given by ijls] pp. 1037], 

W «,«,|. £,-!)' + 

(1) 

Define ko to be fco — max{n — {r — l)q, 1}. The quantity fco 
corresponds to the smallest number of symbols that can occur 
with frequency exactly r in any vector with symbol weight r. 
The size of the set ?\f(r) is given by the following lemma. 



Lemma 3.1: 



\n/r\ 



k—kn 



I^WI = E (l)\P(.n-rk,q-k,r- 1)1 



and 



ko = max{n — (r — l)q, 1} 



rni n+A 

^ I g I ~ 

Otherwise, 



for some A = A{n, q) such that < A < q — 1. 

Proof: If a vector v G Z"^ has a composition n such that 
exactly k of the symbols in Zg have composition r in v, then 
the rest of the symbols must satisfy the inequality, 

n ~ rk < {q ~ k){r — 1). 

This inequality, in conjunction with the requirement that at 
least one symbol must have composition r, determines the 
value of /co • If a composition n has exactly k symbols with the 



value r, then these k symbols can be chosen in (') ways. The 
rest of the elements of n must correspond to a composition 
of n — rfc into q — k parts, each part bounded between and 
r- 1. 

Note that r must satisfy r > [n/g]. We have 

n + q — 1 

n-{r-l)q>l 4^ r< . 

q 

There is exactly one integer r which satisfies [n/g] < r < 
{n + q — 1) / q. This value of r is given by r = (n + A)/g, for 
some A such that < IS. < q — 1. ■ 

The size of the constant symbol weight space SW{n, q, r) can 
now be determined to be 



Ln/rJ 

\SW{n,q,T)\ = 

k—ko 



r, . 



n 

, r, n 



E 

xi^P{n—rk.,q—k.r—l) 



rk' 
n — rk 



(2) 



where x = {xi, . . . ,Xq-k), and r is repeated k times in the 
multinomial coefficient ( " . ) . The size of the bounded 

\'r,. . . ,r,n—rk/ 

symbol weight space is a sum of the sizes of the different 
constant symbol weight spaces, as shown below: 



\SWin,q,<r)\ 



r [n/s] 

E E 

s=[n/(j] k=ko(s) 



E 



x^P{n — sk.q — k,s—l) 



Xi 



...,s,n 

n ~ sk 

, . . . , Xq^k 



sk 



yi, ■ 



,yq 



(3) 



= E 

where x = {xi, . . . ,Xq^k), V = {yi,...,yq), and ko{s) = 
maxjrT, — (s — l)q, 1}. 

The expressions in the equations above can be used to 
provide GV-type existence bounds on symbol weight codes. 
A GV bound on the size of a code C with minimum distance 
d in a space S is given as 

|S| 



lei > 



1) 



V{§,d 

where V{§,d — 1) is the volume of a ball of radius d — 1 in 
the space S. Although the sizes of the constant and bounded 
symbol weight spaces are given by the above equations Q 
and ([3]l, respectively, there are several hurdles in applying 
the GV-type bound directly. First the space itself lacks any 
suitable structure and is not even ball-homogeneous. Even for 
the special case of an FPA in which all the symbols occur 
equally often in every vector, the expression for the GV (and 
also the Hamming bound) is quite unwieldy because the size 
of the ball does not have a nice form; see Huczynska | [T5] 
Theorem 2.7]. Secondly, the expressions for the sizes of the 
spaces are not suitable for expressing the bound in a simple 
form. We instead study the asymptotic form of this bound in 
the next section. To determine the asymptotic results, we first 
need to understand the behavior of the sizes of the symbol 
weight spaces for large block length n. 
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The expression for the asymptotic size of the constant 
symbol weight space is given by the following theorem. A 
similar expression for the bounded symbol weight space can 
be readily derived from this theorem, and is mentioned at the 
end of this section. The following theorem holds for any q 
such that q grows at most proportional to n. Note that all the 
asymptotics are with respect to n and so the term o(l) below 
goes to zero as n goes to oo. 

Theorem 3.2: For any q, such that q = 9n'^, where is a 
positive constant and < e < 1, 



1 



\ogJSWin,q,r)\ 



We first give a brief outline of the proof of this theorem. 
As mentioned earlier, a constant symbol weight space with 
symbol weight r is a union of several constant composition 
spaces, each of which contains vectors of symbol weight r. 
We first show in Lemma 13.31 that the number of constant 
composition spaces does not contribute to the rate of the 
constant symbol weight space. This is not surprising and it is 
true even for the Hamming space, when considered as a union 
of constant composition spaces. Because of this Lemma, we 
now know that there is a constant composition space which 



dominates the expression for the rate. Lemma 3.4 and 3.5 
below help us determine this dominant term. It turns out that 
this dominant term comes from the constant composition space 
which has exactly fco symbols that occur exactly r times in 
any vector 

We continue with the proof of the theorem, by first estab- 
lishing a sequence of lemmas presented below. Let hp{x) be 
the p-ary entropy function defined in the range < x < 1, as 



hp{x) = -xlogp 



p-1 



(l-a;)logp(l-x). 



Lemma 3.3: 



logJ?^(r)|=o(l). 



Proof: The number of terms in the summation over the 
range ko < k < [n/rj is at most n. Hence only one of the 
terms in the summation dominates in the asymptotics. We note 

that \P{n~rk,q-k,r-l)\ < 1:^1. Also, = ("+^7^) (see 



|27 pp. 415]). For a constant q, it shows that P{n — rk, q 
k,r — 1) grows at most polynomially in n and hence 

- log„ |:N-(r)| = - log„ -rk,q~k,r- 1)| + o(l) 
n n ^ 

< ^log,(an«-i) + o(l) 
= 0(1), 



and hence 
1 



n 



log \P{n — rfc, g — fc, r — 1) I < 



g-1 
n + q — 1 



n + q - 1 



log„2 + o(l) =o(l). 



By the above lemma, we can conclude that in the asymptotics 



of 



one term 



-rk 



from the inner summation 



1^ only \xi,...,xg^k' 
donunates in the asymptotics, and similarly only one term 
from (J,) "n-rk) present in the asymptotics. The dom- 
inant multinomial terms are given by an optimal choice of 
k. First, we determine the dominating multinomial term from 
the inner summation in (j2]l. We use the following lemma. Let 
r{x) denote the Gamma function. 



nx) 



In particular, for an integer x, T{x) = (x — 1)!. 



Lemma 3.4: |25 pp. 109] Let xi, . . . , x^ be non-negative 
real numbers. Then 



K 



K 



K 



This lemma immediately implies that (^^ ^ xk) — 
(n/k ^ n/k) ■ Hence the dominating term in the inner sum- 
mation in (jiji is given by ("~'"^), where I = (n — rfc)/((7 — fc)j^ 
For large n we obtain the following asymptotic expression for 

\SW{n,q,r)\: 

- log„ \SW{n, g, r)| = log n-k- log r x 

log (n - rk) + ^ — — log (q - k) + 



(4) 



Neglecting the o(l) term, the maximum of the expression in 
equation Q over k yields the rate of the constant symbol 
weight space. Unfortunately, a closed-form expression for the 
optimizing value of k seems difficult to achieve, even if k is 
considered over reals instead of integers. We instead look at 
how the expression behaves for large n. The lemma below 
asserts that the maximum is achieved at /c* = fco as n oo. 

Lemma 3.5: Let {fn{x)}^^i be a family of bounded, 
strictly monotonic decreasing functions in x, defined over 
the domain x £ [xq,Xq], such that lim„^oo /« (a;) = f{x). 
Let {gn{x)}^^i be a family of non-negative functions such 
that < gn{x) < C„, where C„ depends only on n and 



for some positive constant a. For q = 9n'^, < e < 1, and 
positive constant 0, we get 

I log, < log, 2 + o(l) = o(l), 



' We ignore the fact that the ratios may not be integers. This argument 
can be made more rigorous, but cumbersome, by taking the composition to 



be 



for (q - fc)(l - {^}) times and h = [^1 



for 



(q — k){ } times, where {x} denotes the fractional part of a real number 
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lim„^oo Cn = 0. Then, 



max lim /„(a;) + gn{x) = f{xa) 



= lim max /„ (x) + 5„ (x) . 

n-i-oo xe[xo,Xo] 



Proof: The strict monotonicity fn{x) > fn{y) for any 
x,y, Xf) < X < y < Xq, implies that f{xo) > f{x) for all 
X £ [xq, Xq]. Now, 

max fn{x) + Cn > max /„(x) +g„(x) > max/„(a;) 

xelxo,Xo] xe[xo,Xo] X 

=^ fn{xo) + Cn > max /„(x) + gn{x) > fn{xo) 

xe[xn,Xo] 

lim max /„(a;) + g„(a;) = /(xq). 

n->oo xelxo,Xo] 

We also get 

max lim /„(a::) +5„(x) = max f{x) = f{xo). ■ 

a;e[a;o,Xo] "->-oo xe[xo,Xo] 



This lemma implies that we can determine the asymptotic 
optimum of /„(x) + gn{x) by simply taking the limit of the 
sequence of numbers /„(a;o) + gn{xo), which converges to 

fixo). 



Let 



Proof of Theorem 3.2' We apply Lemma 3.5 as follows. 

n — rk . 



Fn{k) = log n~k- log r 
^ n ^ 



n ~ rk 



log„(g- fc), 



= - log, . 



be defined over integer k g [fcp, [n/rj]. Gn{k) can be upper 
bounded by a term independent of fc, 



and lim„_>oo :^ log, (l«/2j) ~ '^^^ ^'^^^ ^^^^ ^^^'"Y 



n ^ rg, Fn{k) is strictly monotonically decreasing. To 
establish this we relax k to reals and consider the derivative 

F^{k). We get 

, n — rk n — rk 
nF^ik) = -r log, r + r log, — — — + r — — 



- 1 



= r [ log, 
< 0, 



-{q~k) 



iq-k) 



where the last line follows because of the fact that n — rk < 
r{q — fc), and that log a; < (a; — 1) for < x < 1, with 
equality at a; = 1. We also note that n — rk < r{q — k) if and 
only if n ^ rq. Hence F„(fc) is strictly monotonic decreasing 
for n rq. For n = rq, Fn{k) is a constant independent of 
k and fep — q, and hence the optimal value of Fn{k) is at 



concentrate only on determining the asymptotics of F„{ko) 
For r > \n/q'\ we get ko — 1 and 

r r f r \ / r 
F„(l)-— log,-- 1-- log, 1-- 

+ (l-^)log,((7-l)-o(l) 



For r = \n/q] = (n + A) / q, we have ki^ = q~ /S. — n — [r - 
l)q, and 



A) 



A(r- 1) 
l-o(l). 



log, ^ - log. 



n + A 
nq 



This proves Theorem 1 3. 2 1 ■ 

The exponent of the asymptotic size of the bounded symbol 
weight space SW{n,q,< r) is always n{l — o(l)) since it 
contains SW{n,q, \n/q']). 

IV. Asymptotic size of symbol weight codes 

In this section we provide estimates on the rate of symbol 
weight codes for all q = Orf, for any positive constant 0, and 
for < e < 1. We considered the asymptotic behavior of the 
symbol weight spaces because of the difficulty in determining 
reasonable expressions for fixed n. Below, we determine upper 
and lower bounds on the rate of a symbol weight code. First 



we determine a GV-type bound in Theorem 4.3 below. The 



Singleton and Linear Programming (LP) upper bounds on 
codes in the Hamming space are applicable to the symbol 



weight codes too. In Theorem 4.4 we show that for constant 
symbol weight codes, the Singleton and LP upper bounds can 
be improved substantially for a specific range of the symbol 
weight. 

The following lemma is immediate and it shows that the 
rate of symbol weight codes can be given in terms of the rate 
of a CCC. 

Lemma 4.1: 

-log, Af'^(ri, d, r) = - max log, ^,(n, d) + o(l), 
n ^ ^ n neyi(r) ^ 

-log {n,d,<r) ^- max log^,(n, rf) + o(l). 

(5) 

Proof: Note that we clearly have the following upper and 
lower bounds on A^'^ (n, d,r): 

max Ag{n,d) < A^^{n,d,r) <\J<{r)\ max AJn,d). 

(6) 



The lemma now follows from an application of Lemma 3.3 



k = q. Since Lemma 3.5 is applicable to F„(fc) + G„(A:), we 



The second expression in (|5]l can be determined similarly. 

We state the LP upper bound on codes in the Hamming space 
from Aaltonen ||2). 



Theorem 4.2: |j2J 

limsup — log, Aq{n, d) < hq{kq{5)), < S < . 
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where kg{x) = ^(g - l)x(l - x), < a; < 1. 

An upper bound on symbol weight codes is readily obtained 
by an upper bound on codes in the Hamming space, since 

A^^{n, d, r) < A^^in, d,<r)< Ag{n, d). 

Thus for constant q the LP bound is also an upper bound on 
symbol weight codes. For q growing with n, the Singleton 
bound is an upper bound on symbol weight codes. Below, we 
provide asymptotic estimates of symbol weight codes. 

Theorem 4.3: Let q = dn'^, where < is a constant, and 
< e < 1- Let d/n — > S and r/n ^ p as n ^ oo, where 
< 6 < ^— Then for q constant, i.e., e = 0, 



liminf - log„ A^„^{n, d, r) > h,{l - p) ~ hq{5), 

n— >-oo n 

lim inf - log„ ^ (n,d,<r)>l-hJS). 

ri— >oo n 



(7) 



For q increasing with n one can use the Singleton bound. Thus 
for < e < 1, we get 

liminf — log A^^ (n, d,r) > 1 — p — 6, r — pn, 

lim — log Ag^ {n,d,r) = 1 — S, r ~ o{n), 

lim -log„A^^(n,d, < r) = 1-^, any r. (8) 

Remark: Note that for q increasing with n the following limits 
can be inferred, 

lim — log Ag^ {n,d^r) = lim —log A„{n^d), r = o{n), 
lim — log Ag^ in, d,< r) = lim — log^„(n, d), any r. 

n— foo Ti y y n— foo Ti 

Proof: We use the following lower bound on the constant 
symbol weight space, which is actually an Elias-type bound on 
the Hamming space (see Levenshtein ||T9[). This is followed 
by using the GV bound in the Hamming space. 



- log A^'^in, d,r)>- log A,(n, d) + hJl - - 



A,in,d) < — — ^ -A^^^{n,d,T) 

\bW{Ti,q,r)\ ^ 

:^log,A,( 

-1-0(1) 
0(1) 



>hJl--)--log^V{Zld-l) 
V Ti/ n ^ ^ 



^/.,(l-9-/.J^^)-o(l), 



where hq{x) is the q-ary entropy function and V{Z^^, d—1) is 
the volume of the ball of radius d — 1 in the Hamming space. 
Similarly, for the bounded symbol weight space, we obtain 



log,A,^^(n,d,<r) > 1-/1, 



d- 1 



o(l). 



For a constant q, the asymptotics of these expressions are as 
given in (|7]i. 



For q growing with n, the upper bound on the symbol 
weight codes is provided by the Singleton bound, 

A^^{n, d, r) < A^^{n, d,<r)< Ag{n, d) < q^'-'^+K 

Using the fact that hq{x) = x in the limit as g — > oo, we get 
the results as stated in the theorem. In particular for r = o(n), 
lim„/i,(l — r/n) — 1 and lim„/ig(((i — l)/n) = S. Since 
Aq^ {n,d,< r) is greater than A^^(n, d, [n/g]), it gives 
the result stated in (|8j. ■ 

The lower bound (|7]l in the theorem may be interpreted 
as a GV-type bound in the symbol weight space that can be 
obtained if the volume of a ball of radius d — 1 in the symbol 
weight space is upper bounded by the volume of a ball of 
radius d — 1 in the Hamming space. Since, the symbol weight 
space is not ball-homogeneous, that is, the size of the balls 
of radius d—1 depends on the center, we adopt the above 
method to derive the GV-type lower bound|^ 

In the following theorem we provide an improvement on the 
upper bound for a constant symbol weight code with symbol 
weight r. 

Theorem 4.4: Let \njq\ < r < InjZ^ q = OTi'^ with < 
e < 1. Let d satisfy r < d. For rt -> cx), let r/n -> p, and 
d/n — > S. Then, for constant q, 



limsup-logg A^^(n,d,r) < /i, ( 1 - 



i-Ip 



-il-p)hq[^-^j+l-lp. 



For q growing with n, we get 



1 3 

limsup-logg A^^(n,d,r) < 1 - -p. 



The proof of this theorem relies on a Johnson-type upper 
bound, and a lemma given below. We follow some elements 
of the derivation of the Singleton bound in Omrani and Kumar 



|26|. However, our purpose is to improve the Singleton bound 
by using the parameters of the constant symbol weight space. 
The improvement mainly stems from the following lemma. 

Lemma 4.5: Let d>r> 2n/3. Then A^^ {n,d,r) — q. 
Proof: We claim that if there are two codewords x,y 
both with symbol weight r, then the symbol which repeats r 
times must be different in the two codewords. Suppose not. 
Then the two codewords x, y must have at least n — 2(n — r) 
coordinates which contain the same symbol. Thus the distance 
between the codewords is at most d < 2(n — r) which implies 
d < 2n/3, since r > 2n/3. This is a contradiction. We get 
Aq^ {n, d, r) < q. To show the opposite inequality, let a; be a 
word with symbol weight r, r > 2?i/3. Then x + al, a £ 1q, 
where 1 is the all-one codeword, are also codewords with 
symbol weight r. This establishes that A^^(n,d, r) > g. ■ 

We next give the Johnson-type bound. 

-For certain parameters, better lower bounds on Aq{n, d), for instance from 
algebraic geometry codes, can improve on this GV bound on (n, d, r). 
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Lemma 4.6: 



A^^(n,d,r)< 



nq 



Proof: Consider the code-matrix of the constant symbol 
weight code with parameters {n,M,d,r)q. Any row of the 
code-matrix has at least one symbol of frequency r. Fix 
one symbol of frequency r in each row. There are a total 
M{n — r) symbols in the code-matrix which do not contribute 
to the symbol weight in any codeword. The average number 
of symbols, averaged over the n columns, with frequency at 
most r is then M{n — r)/n. The average number per symbol, 
averaged over n columns and q symbols is M{n — r)/{qn). 
Thus, there exists at least one symbol a and at least one 
column m such that the subcode consisting of the symbol a 
in column m has size at least M{n~ r)/{nq). Discarding the 
coordinate corresponding to m gives us the bound as stated in 
the Lemma. ■ 



Proof of Theorem 4.4 



We now proceed to prove the 
theorem. Apply Lemma |4.6| recursively / + 1 times to get 



nq 



{n-l)q 



.-l-r 
Al^{n-l-l,d,r) 



The recursion stops for / such that r = \2{n — I — l)/3] and 
for d > r. For this value of r and d, A^"^ {n, d,r) = q, 
by Lemma 4.5 The condition I > implies r < 2n/3. 



Constraints on I are obtained from the inequalities 

2{n-l-l)/3<r = \2{n-l-l)/3] <2{n-l)/3. 



This gives us the upper bound 



iSW 



{n,d,r) < 



■(n-l 



n-3r/2+l 



{n — r 

_ „-3r/2+l 
fn-r\ y 

u+i; 

In the asymptotics as n ^ oo we get Z/n — > 1 — 3/2/?. This 
gives us the upper bounds as stated in the theorem. ■ 

In the case of q growing with n, this theorem improves on 
the Singleton bound 1 — (5 for (5 < 3p/2. The upper bound 



in Theorem 4.4 for constant q improves on the LP bound for 
certain range of parameters. The improvements are possible 
only for large p and for q > 5. For q — 2,3 the restrictions 
S > p and p < 2/3 do not leave room for improvement. For 
constant q the upper bound is in fact concave in shape. This 
can be verified by taking the second derivative with respect to 
p, which results in the negative expression —l/{p{l — p)\nq). 
Since this bound does not depend on S, it seems that further 
improvements might be possible. 

An example plot of all the bounds are provided in figures 
[T] and |2] Since the improvements are for larger q, we show 
the bounds for q = 16. In figure [T] the first plot is obtained 
at a particular value of S and the second plot is obtained at a 
particular value of p. The improvements (over LP) are obtained 
in the regions 0.536 < p < 0.67 and 0.60 < S < 0.774, 
respectively. Figure |2] shows the plots when q is increasing 



with n. In this case, we compare against the Singleton upper 
bound. It shows improvements in the region p < S < '^p. 
Construction of codes which meet this upper bound for any 
parameters is an open problem. 

Remarks: 



1) For q > n and r = 1, Dukes p2) provides a Singleton 
bound, A^^{n, d,l) < q{q - 1) ■ ■ ■ {q - n + d). Not 
surprisingly, for q = On, 9 > 1 this reduces to 1 — 5 in 
the asymptotics. 

2) Missing from the list of bounds above is a Hamming- 
type bound on the symbol weight codes. The lack of a 
simple expression for the size of the ball is the main 
reason behind this omission. 
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Fig. 1. Plots for 5 = 2/3 and for p = 0.6, respectively, under q = 16. 

V. Lower bound on symbol weight codes 

As mentioned in the previous sections, the traditional means 
of determining the GV-type bounds is not very useful for 
non-asymptotic block lengths n. In this section we adopt a 
different approach to determine lower bounds on the size of 
symbol weight codes. The lower bounds are obtained by using 
corresponding constructions and lower bounds on constant 
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Fig. 2. Plots for S = 2/3 and for p = 0.6, respectively. 



composition codes. There is a wide body of literature on 
CCCs that can be used to determine these lower bounds; see 
||5)-|[T6 |. To the best of our knowledge, most of the work 
in the literature on constant composition codes has focused 
on determining constructions and bounds for either very large 
distances (in the region where the Johnson bound or Plotkin 
bound is applicable) or for very small distances such as 
d = 2,3,4. For FPA and permutation codes, there do exist 
constructions with distances in the ranges in between; see f4|, 
|8j^ | |T5| . We note three papers in this connection. Sidorenko 
1 28 1 provides an asymptotic upper bound on CCCs; this is not 
useful for this section since we look at lower bounds. In a very 
recent work, Luo and Helleseth |23| construct CCCs of almost 
uniform composition and with relative distances very close to 
the Plotkin limit (d/n » (i-^)/q). The bound that we provide 
below requires a relatively large alphabet size so that codes 
from different constant composition spaces can be combined. 
Hence, the CCCs from [23] are not useful in this context. 
Chu et al. 1 8 1 provide some lower bounds for CCCs for large 
distances. We use some of the constructions from this latter 
work in this section to provide examples of lower bounds on 
symbol weight codes. 

We first describe a method to determine the size of a symbol 



weight code in terms of CCCs. This method may be viewed as 
a generalization of an elementary bound in Bachoc et al. fS] 
Equation (2)] on binary bounded weight codes to q-nry spaces. 

Let n = [no, . . . , rig^i] and n' = [uq, . . . , n^_i] denote 
two different compositions of n. The aim here is to lower 
bound the size of a symbol weight code by the sum of 
aU possible different CCCs which have the same symbol 
weight. Thus, we first need to determine the condition on two 
different compositions n and n' such that any vector c with 
composition n is at least distance d away from a vector c' 
with composition n' . It can be seen that mm{ni,n[} is the 
maximum number of coordinates in c and c' where the i- 
th symbol is common to both. Thus, the Hamming distance 
dnic, c') satisfies 

9-1 

dnic, c!) > n — minjrij, n^}. 
A sufficient condition for dnic, c') > d to hold is 



9-1 

E 

i=0 



min{ni, n'^} > d. 



Let 



9-1 



d+(n, n') = n — mining, n[}. 
Then we obtain 



(9) 



(10) 



i=0 



Lemma 5.1: d^{-,-) is a distance function on 

Proof: d+{n,n') is clearly symmetric. To show the 
triangle inequality, we note that we can rewrite 

9-1 

^^.(n, n') = Ui — minjrii, n^} 

i 

where (a;)"'" = max{a;,0}. Also, for any non-negative real 
numbers x,y, z, it can be readily verified that 

(x — y)^ + {y ^ z)^ ^ {x — z)^ . 

Thus, we get 

d+(n, n') + d+(n', n") ^ Y.^n, - + E« " 

i i 

i 

i 

— d^{n, n"). 

Since (n^ — nj)+ > 0, we get that d^{n,n') = if and only 
if rij = for alH = 0, (7 — 1. ■ 

Let yi{r,d) C ?^(r) (resp. 3Nf(< r,d) C K(< r)) be such 
that for any distinct n, n' e 3\f(r, d) (resp. 3Nf(< r, d)) we have 
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d+{n,n') > d. We can now readily give a lower bound on 
the size of symbol weight codes in terms of the CCCs: 



neJ<{r,d) 
neyi[<r,d) 



(11) 



For large n and q the size of the set J^{r) becomes very 
large. Hence finding all the compositions in 3\f(r) which are 
separated by distance at least d is difficult. We instead seek 
lower bounds on 3\f(7-, d) so that the size of the symbol weight 
codes can be more easily expressed in terms of the sizes of 
either one or a few CCCs. 

Remark: Note that dj^{-,-) is a metric on a "simplex" which 
intersects each axis at (Euclidean) distance n from the origin. 
In particular, the components of n need not be restricted to 
integers for •) to become a metric. Also, n need not be 
restricted to be an integer. 



A. Lower bounds on |?\f(r, d)| 

In this section we determine lower bounds to the size of 
[N'(r, d). To get these lower bounds, we first obtain a relation 
between the Hamming distance between two compositions and 



the distance between two compositions as given by ( 10 1 



Lemma 5.2: For any two compositions n,n' E "N, if 
duin, n') = 2d then (i+(n, n') > d. 

Proof: Define two sets /+ = {i : > n'^} and /~ = 
{i : TLi < n'j}. Clearly, in the rest of the coordinates, rii — n'^. 
Then we get the following set of equalities. 



71,: 



q-l 
1=0 

iei+ iei- 
ie/+u/ 



{rii - 7i-)+ = 



iei+ui- 



(12) 



Note that the LHS and RHS of the last equation are both equal 
to d+{n, n'). Let IJ+I = x, then since d/f (n, n') = 2d, we get 
|7~ I = 2d — X. Using the fact that the difference rii — n[ > 1, 
for i E and ti^ — > 1 for i E I^, we get 

d+{n, n') > max{x, 2d — x} 

> min maxja;, 2d — x} 

l<x<2d-l 



1) Lower bound from a permutation code on Sr: Let the 
alphabet set be {0, . . . , r — 1}, that is, q = r. In every word of 
length n = r{r + l)/2, let all the r symbols occur such that 
the frequencies of the symbols are in the set {1, . . . ,r} and 
all the frequencies occur Because of this construction, given r 
the values of n, q are restricted as given above. Using the GV 
lower bound on the permutation code with Hamming distance 
at least 2d between two codewords gives us the lower bound 



Wr,d)\ > 



(13) 



V{2d~'l,Sr)' 

where V{2d— 1, Sr) is the volume of the ball of radius 2d— 1 
in Sr- 

2) Lower bound for general q: As explained in the proof 
of Theorem |4.3| a lower bound on constant symbol weight 
codes is provided by an Elias-type bound in the Hamming 
space. We can obtain another lower bound on constant symbol 
weight codes by considering lower bounds on INf(r, d). A lower 
bound on ?^(r, d) is obtained by letting k of the symbols 
{0, . . . ,q—l} repeat r times in every codeword of the constant 
symbol weight code and the remaining q — k symbols satisfy 
the condition that there are {q — k){ "^^ } symbols with 
composition li = and (q — k){l — {^^—^}) symbols 



with composition Iq = L^ip^J- Denote this composition by 

n = n{lQ, li, k, r), that is, 

n{lo,li,k,r) = [r^^^^^, /p,.^. , ji, ■ -^^ ]• 

Because of the above choice of the repetitions of each symbol 
we seek a "binary constant weight code" in INf(r, d) where q—k 
coordinates have the value Iq or li and the rest k coordinates 
have the value r. We also want the Hamming distance between 
distinct codewords to be at least 2d. Denote the maximum size 
of a binary constant weight code of length n, weight w and 
minimum distance d by A2{n, d,w). 

The GV bound under the above constraints is 



mr,d)\>A2iq,2d,k)>-^ 



k\(q-k\ 



::)( 



Note that we get 2d — 2 in the denominator (instead of 
2c? — 1) since the binary constant weight space affords only 
even distances. From Levenshtein [TSl we know that the lower 
bound is significant and grows exponentially as 2''^, for some 
constant T = T{q, k, d) only when the following conditions 
are satisfied 




The above lower bounds on |>f(r, give lower bounds on 
symbol weight codes as follows. 



This lemma immediately allows us to use existing GV bounds 
in various spaces (under Hamming distance) to derive lower 
bounds on \H{r, d)\. 



B. Lower bounds on codes 

The lower bound on d, r) can be stated as follows. 
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Theorem 5.3: We get the following results for different 
compositions. 

1) For n = [1, . . . , r], we get 

2) For n = n(Zo, h, k, r), we get 

A^^"^ {n,d,r) > A2{q,2d,k)Ag{n,d) 



> 



Aq{n,d). 



3) Let fci > fco = max{n — (r — l)q, 1}, and b = b{ki) 

/r\--ki I 
2d J ■ 



Then 



Af^{n,d,r)> max 



b 

V 

fco</ci<Ln/H ^ 



A2(q, 2d, A:i+2(ii)x 
A5(n(;o,Zi,fci + 2dj,r),d). (14) 



Proof: The first two results follow immediately from 
the lower bounds on |N(r, In part 1, each codeword in 
the permutation code corresponds to a rearrangement of the 
composition n = [1, . . . , r]. In part 2, each codeword in the 
binary constant weight code corresponds to a rearrangement 
of the composition in n = n(^o, Zi, fc, r). 

For the third result, we include a larger range of CCCs. 
The expression is obtained by taking constant composition 
codes from separate constant composition spaces n(Zo, ^i, + 
2di,r), whose compositions are separated by a Hamming dis- 
tance of at least 2d. Two different compositions n(Zo, li,ki + 
2di,r) and n{lQ,li,ki + 2d{i + l),?") correspond to taking 
binary constant weight codes with weights separated by 2d. 
Note that this choice of separate compositions corresponds to 
a binary bounded weight code, as studied in |j3]. ■ 

There is a trade-off between the size of the constant compo- 
sition space with composition n{lQ, li, k, r) and the size of the 
constant weight code in Z'. The size of the constant weight 
code in is substantial only for large k around q/2. On the 
other hand, the size of the constant composition space is large 
for small k, thus potentially allowing for a larger CCC. 

The lower bound in equation ([14]) is in fact useful in the 
case of a bounded symbol weight code with symbol weight 
at most r. It is unclear how to combine codes of different 
symbol weights s, where \n/ q] < s < r, such that we can 
obtain a computable expression. We instead use equation ([14]) 
and optimize over the different symbol weights s and the 
smallest weight fci. Note that for a given symbol weight s, the 
quantity ki corresponds to the minimum number of symbols 
with frequency s that we include in our estimate. 



Theorem 5.4: 



< (",d,<7-)> max A^'^{n,d,s) 



bis) 



> max max > A2{q,2d, k{i, s)) 
X A,(n(Zo(j, s), s),k{i, s), s), d), 



where fci = fci(s) > max{?i — (s — l)q, 1}, k = k{i, s) — 
fci(s) + 2di, b{s) = 



[?7./sJ-fci 



2d 



= L^J, and 



r n—sk i 
I n—s I ■ 



C. Numerical examples 

We consider some numerical examples in order to show 
how the expressions in the previous section can be used to 
obtain lower bounds on symbol weight codes. We adopt the 
exponential notation of Chu et al. |8| to denote a composition 
in a compact form. The notation riQ°n* 
denote the composition 



n^q_i is used to 



,no,ni, . 



T-q-li 



to 



We also recall the notion of a refinement of a composition from 
the same work. The composition n = [no, . . . 
a refinement of a composition m = [ttiq , . . . 



is called 



'p-i 



if there is 



a partition I^, ■ . ■ , Ip-i of {0, . . . , 1} such that X^ie/ — 
rrij, for every j. We write n =<; m if n is a refinement of m. 
This notion is important because of the following inequality 
(see §): 



Aq{[nQ, . . . ,ng_i],d) > Ap{[mo, . . .,mp^i],d). 



(15) 



Below, we use lower bounds on FPAs, where the lower bound 
is taken from |8 1. Using equation ( [TS] ), lower bounds on CCCs 
are obtained from the lower bounds on the FPAs. The lower 
bound on FPA mentioned below rely on the existence of 
certain (generalized) distance preserving mappings from 
to the permutation space S'„ (see |8|). The distances between 
compositions used in this section are all taken in the d+{-,-) 
metric, unless mentioned otherwise. 



Example 5.5: In this example we show how Theorem 5.3 
and equation ([TTJ can be used. We know from |8 Example 3.7] 
that ^4(6'*, 7) > 2^2. Since l^S^ =<; 6"*, we immediately obtain 
that ^8(1^^5'', 7) > 2^2. In this case, g = 8 and d = 7 and 
the number of symbols occurring with frequency 5 is k = 4. 
Hence Theorem 5.3 is not applicable. But equation ( [TT] ) can 
be applied directly. For instance the compositions 1*5^ and 
5*14 satisfy d+{l*5^, SH^) = 4(1 - 5)+ + 4(5 - 1)+ = 16 
which is greater than 7. Thus, ^^(24, 7, 5) > 2 • 2^^. 

In fact, the compositions n = l^S"* and n' = 5'' I"' have the 
special property that if a symbol i has different compositions 
ni,n'^ then jn; — n'^\ — A. We can exploit this property to get 
a variant of Lemma 15.21 below. 



Lemma 5.6: For two compositions n,n' e 3Nf let 
dH{n,n') = D. For i = 0, ... ,g — 1, if either \ni — n[\ > 
a > or it is zero, then d+(n,n') > Da/2. 

Proof: The proof is very similar to the proof of 
Lemma 5.2 Let be as defined in the proof of that 

lemma, and let |/+| = a;, \ = D — x. Finally, use equation 
( [T2] ) to get 

d_|_(n, n') > a max{x, D — x} 

> a min max{a;,Z3 — x} > Da/2. ■ 

l<x<D — l 
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Example \5.5\ (contd.): Returning to this example, we 
know that if two compositions differ in a symbol, then 
the difference is 4, and so a = 4. We will have the 
distance between two compositions at least 7 if we 
ensure (using Lemma 5.6 1 that Da/2 > 7, that is, 
D > [14/4] = 4. Using q ^ 8, D ^ A,k = A, v/e get 
that ^1^(24,7,5) > ^2(8, 4, 4)A8(1'*5'*, 7). The size 
of the binary code is obtained from Agrell et al. 
^2(8, 4, 4) — 14. This gives the much improved lower bound 
^1^^(24, 7, 5) > 14-212. 

Example 5.7: We continue with the previous example and 
a finer refinement 1^2* ^ We show in this example 
that directly using equation ( [TT| can lead to a better bound, 
compared to Theorem 5.3 In this case, we apply Theorem 5.3 
with fc = 8, 2d = 14, q = 16, and n{lo,li,k,r) = 1^. 
From the table of constant weight codes in |[T), we get 
^2(16, 14, 8) = 2. This gives Afg^(24,7,2) > 2-2^"^. 
However, this bound can be improved by using equation ( fTT) . 
A greedy search through the compositions with the maximum 
value of each part being 2, show that the compositions 
03^2311^ Ii2403li2^ Ii2803li23, 2ili23li23li2302li, and 
2^01 2'*0i 2^01 2^01 are mutually at distance at least 7 from 
one another. Since each of them is a refinement of 6'* we get 
that Af6^(24,7,2) >5-2^^. 

Finally, we look at an example which considers the bounded 
symbol weight and demonstrate the use of Theorem |5.4| 



Example 5.8: Consider the refinement 3 =^ 6 . We 
can consider a code in this space to be embedded in 
the constant composition space with 16 symbols. Thus, 
we get Aie{3^0^,7) > 2^2. We use the fact that if we 
consider all compositions containing 8 symbols occurring 
with frequency 3, then the difference of frequency between 
two symbols from different compositions is either or 3. 
Thus, a = 3, fc = 8, and q = 16. Using Lemma 5.6 we 
need to ensure that Da/ 2 > 7 ^ D > 5. We get the 
lower bound Ai6(24,7,3) > ^2(16, 5, 8)Al6(380^ 7). From 
J!) we have ^2(16, 5, 8) = ^2(16, 6, 8) > 120. Hence, 
Aff (24,7,< 3) > max{^fe^(24,7,3),Af>^(24,7,2)} > 
120 • 212. In this particular example, using the lower bound 
in equation ( [T4] l on the constant symbol weight codes with 
fci = 1 does not yield a better bound, primarily due to the 
absence of a known good lower bound on the corresponding 
CCC. The improvement mainly stems from the fact that we 
use a large binary constant weight code with weight ki — q/2. 



As is evident from the above examples. Theorems 5.3 and 
|5.4| help in actual computation of the bounds. In Example |5.7| 
the number of ordered partitions of n = 24, into q = 16 
parts with each part taking values between and 2, inclusive, 
is |P(24, 16, 2)1 ~ 258570. A non-exhaustive greedy search 
could only find 5 compositions. It is computationally difficult 
to search exhaustively in such a large space. Similarly, for 
larger lengths and alphabet sizes, finding the size of, and 
compositions in, J^{r, d) is difficult. Instead, by relying on 
known bounds on binary constant weight codes and CCCs, we 
compute the sizes of the symbol weight codes more easily. 



VI. Constructions of symbol weight codes 

In this section we determine constructions of symbol weight 
codes. Versfeld et al. |29J , 1^0| provided constructions of 
bounded symbol weight codes from Reed-Solomon codes. We 
seek to obtain constant symbol weight codes with positive rate 
and positive relative distance. The following two constructions 
provide us with such codes with positive rate and positive 
relative distance, given that we already have a constant symbol 
weight code with positive rate and positive relative distance. 

u\v construction: Let C be a constant symbol weight code 
with parameters C(n, M, d, r)q over Z^. Let C be an FPA over 
with parameters C'{r'q, M' , d'). Then the u\v construction 
results in a code 2). It is obtained by taking all codewords as 
follows: 

D = {{u,v) : M e e,t; e &}. 

The code D has parameters ©(n + r'q, MM' , minjd, d'}, r + 
r')q over Z^. In particular if the code C had the minimum 
symbol weight r — \n/q] then so does the code D, that is, 
r + r' = \{n + r'q)/q'\. 

Concatenated construction: Let C be a code over Z, with 
parameters C(n, M, d)q. Let C be an FPA with parameters 
G' {rp, M' , d')p over Zp, such that M' > q. The concatenated 
code D with C as the inner code and C as the outer 
code has parameters D{nrp, M.dd' ,rn)p. It is obtained by 
replacing every q-ary symbol of C with a codeword from C. 
In particular, the resulting code D has the minimum symbol 
weight npr/p = nr. 

A. Constructions from Reed-Solomon codes 

In this section, we use the symbol k to denote the dimension 
of the Reed-Solomon code. Let C[rt, fc, d\q be a Reed-Solomon 
code over a finite field Fq with d = n — k + 1 and 
n = q — 1. Versfeld et al. p9) , | [30) showed that aside from 
the Reed-Solomon codewords which correspond to a constant 
polynomial, the Reed-Solomon code has maximum symbol 
weight of n — d = fc — 1. It is also established in the same 
works that there exists a coset of the Reed-Solomon code such 
that the maximum symbol weight of any codeword in the code 
is at most n—d-\-\. These codes belong to the bounded symbol 
weight space SW{'n, q, < r). By the Singleton bound, these 
are optimal codes. 

In this section we establish several results which show that 
subsets of Reed-Solomon codes or their cosets can achieve the 



GV-type lower bound in Theorem 4.3 First, Lemma 6.1 below 
shows that for any constant symbol weight r, there exists a 
coset of the Reed-Solomon code that attains the GV bound 



asymptotically. In Theorem 6.4 we provide a more explicit 



description of a subset of the Reed-Solomon code itself that 
has the constant symbol weight r for r > n/2, such that it 
attains the GV bound asymptotically. Since k — 1 > r, this 
also means that this latter result holds only for Reed-Solomon 
codes with rate more than 1/2. 

We first show by an averaging argument that the GV bound 
can be achieved by subcodes of cosets of Reed-Solomon 
codes. 

Lemma 6.1: Let Q[n,k^d\q be a family of Reed-Solomon 
codes. For n ^ 00 let r/n p, and d/n — ^ 6. Then there 
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exists a family of subcodes C which is a subset of some coset 
of the code 6 such that 

liminf-logje'l >l-p-S. 

n— >oo Jl 

Proof: Let Ci, . . . , C^n-k denote the cosets of the Reed- 
Solomon code. Since the cosets of the Reed-Solomon code 
are disjoint and they partition the Hamming space, we have 

J2 \e,nSWin,q,r)\ = \SW{n,q,r)\. 

i=l 

Thus, the average size of the intersection of a coset with the 
space SW{n,q,r) is \SW{n,q,r)\/q"'~'' . Hence, there exists 
at least one coset whose intersection with SW{n, q, r) has size 
at least this average. In the asymptotics for n, q oo, we get 
the result stated in the Lemma. ■ 

In the remaining part of this section, we give a more explicit 
description of a subcode of the Reed-Solomon code with rate 
equal to the GV-type bound. The derivation of this result uses 
a lemma and a proposition stated below. The Proposition 6.2 
below states that asymptotically the rate of the constant symbol 
weight code with symbol weight r that is a subset of the Reed- 
Solomon code, can not exceed the rate of the subcode formed 
by all the codewords of weight n — r. Lemma 6.3 gives an 
upper bound on the size of the number of codewords of weight 
n — r. The combination of this proposition and the lemma 
imply that the rate of the constant symbol weight code, which 
is a subset of the Reed-Solomon code, with symbol weight 
r — pn, can not exceed the GV-type lower bound 1 — p — 5 
that we obtained in Theorem |4.3| for any p, < p < 1. 
Theorem 16.41 below shows that this rate can be attained for 
any p satisfying 1/2 < p < 1. We state the proposition and 
the lemma first, and defer their proofs to after the proof of the 
theorem. 

Proposition 6.2: Let C[n, fc, d]q be a family of Reed- 
Solomon codes with parameters n = q—l,d^n — k + l. Let 
S{r) denote the set of vectors with symbol weight exactly r, 
for 1 < r < fc — 1, and let denote the number of vectors 

of weight n — r. Then, 

\S{r)\<qiq-l)B,,_r- 

Lemma 6.3: The weight distribution {S^ : w — d,...,n} 
of a linear Maximum Distance Separable (MDS) code with 
parameters [n,k,d]q satisfies: 



k—r 



for < r < fc - 1. 



The main theorem in this section is now stated below. 

Theorem 6.4: Let G[n,k,d]q denote the family of Reed- 
Solomon codes with n — q — 1 and d — n — k + 1. Let 
fc — 1 > r > n/2. For n oo, let r/n p and d/n — > 6. 
There exists a family of subcodes C of 6 of symbol weight 
exactly r such that 

1 



Proof: Every codeword of the Reed-Solomon code con- 
sists of coordinates which are the evaluations at all the non- 
zero points of ¥q, of a polynomial of degree at most fc — 1. Let 



be a polynomial in F^. Let 



q — ¥q \ {0}. If f{x) has symbol weight r then it implies 



fix) = fo + fix 

F, 

that f{x) — a for some a E ¥q and for r different values of 
X in F*. In other words, f{x) — a has exactly r distinct roots. 
Note that r is restricted to be r < fc — 1 since the polynomials 
can not have more than fc — 1 roots. 

Let f{x) be a polynomial of degree fc — 1 such that it has 
exactly r nonzero distinct roots ai, . . . , a,- in F*. Then f{x) 
can be written as 

f{x) — f3{x — ai) X ■ ■ ■ X (x — Ur) X g{x), 

where /3 e F* and g{x) is a product of monic irreducible 
polynomials, each of degree at least 2. The total degree of 
g{x) is fc — 1 — r. Since r > n/2, the polynomial f{x) 
can not attain the value a, where a E ¥*, at more than r 
different points x e¥* since there are q—l — r < n/2 points 
at which the function is nonzero. Hence, the symbol weight 
of the codeword represented by f{x) is exactly r. We seek 
the asymptotic exponent of the number of such polynomials 
f{x). This number is dominated by the number of possible 
monic irreducible polynomials g{x). To describe this number, 
we recall the definition of the Mobius function p{t), 

ift = l, 

if t has s distinct prime factors, 
if p^\t for some prime p. 

The number of monic irreducible polynomials of degree t is 
given by the sum j X^sit s>i (^cc Lidl and Niederre- 




iter 1 20 Theorem 3.25]). In particular, for large t, this sum 



As a 



is dominated by just the first term ^p,{l)q* 
consequence, asymptotically the number of polynomials f{x) 
is described by the number of monic irreducible polynomials 
g{x) of degree fc — 1 — r. The asymptotic exponent of this count 
is approximately lim„_j.oo(fc — 1 — r)/n = 1 — S — p. Thus 
the subset C of C consists of all the codewords obtained from 
at least these polynomials. Hence the count above provides a 
lower bound on the rate of C: 

liminf-logje'l >l-5- p. 

n— >-oo n 

The upper bound on the rate of G' is obtained by applying 



both Lemma [63] and Proposition 6.2 We get 



le'l < \S{r)\<q{q-l)B,,^r 



<'^^'^-'An-r)'^'^ 



and. 



1 fc 
limsup — log„ |S'(r)| < lim — 



Proof of Proposition 6.2 Let c = (ci,...,c„) be a 



lim 



>oo n 



iogje'| = i-p 



codeword in the Reed-Solomon code. Then c is the image of 
a polynomial c{x) evaluated at all points of F* = F^ \ {0}. 
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We write c = {c{x))x£¥-^ - If c resulting from c(x) has symbol 
weight exactly r then so do the codewords obtained fi^om the 
polynomials jc{x) + /3, for 7 e F*,/3 e F^. Consider the 
subset S'{r) of S{r) that is obtained by retaining exactly one 



monic polynomial from the set {70(2;) 



7 e F* /3 e 



Fg} for any polynomial c{x). Thus, the size of S'(r) satisfies 
|5'(r)| = |5(r)|/(g(<z-l)). 

We claim that |5"(r)| < Bn-r- To show this, we claim that 
there exists an injection mapping from S'{r) to the set of all 
vectors of weight n — r. Since any c{x) in S'{r) has symbol 
weight exactly r, there exists a /3 G Fg such that c{x) — /3 has 
exactly r distinct roots. Thus, the codeword {c{x) — /3)a;gF* 
has Hamming weight exactly n — r. This is the only such 
vector. If there exists e(x) G S'{r) and a S Fg such that 
{e{x) — a)a;gF* = [c{x) — /3)a;gF', then the two polynomials 
c{x) and e{x) must satisfy the relation c{x) — f3 — e{x) ~ a 
since they are the same on n = g — 1 points and their degrees 
are at most k — 1 < n. Thus, c{x) = e{x) — a + /3, which is 
not possible since S'{r) contains exactly one polynomial of 
this form. ■ 

The expression for the weight 



Proof of Lemma 6.3 



distribution satisfies (see 124 Chapter 11]): 



J=0 



J 



!)■ 



Retaining only the first term gives the required upper bound 
on Bn-r- The above expression can be rewritten as 



Bji—r — 




y{k-r-l)/2\ 



n — r 
2i - 1 



fc-r-(2i-l) 



-1)- 



I{2\k- 



n — r 
2i 

r - 11 



n ~ r 
k — r — 1 



where I{2 | A: — r — 1) is an indicator function that is 1 
if 2 does not divide fc — r — 1 and otherwise. We show 
that each of the terms in the summation above is positive and 
hence we can upper bound B„-r by only the first term. This 
is proved via the following sequence of inequalities. For any 
j, j = 0, . . . , fc — r — 1, we have 



J + l 



k—r—j—1 



-1) 



1 > 



> 



-1) 



n r J . fc-r-.,-l 

n — r-j 1 - 



J 



1 



1-q 



-(k-T-j) 



We use the inequality q > (n— r)/(l — g ^). This expression is 
greater than the RHS of the above because of the inequalities 

< 1 - q-C'-^-i'), 1 > 1 - g-C'^-'-J-i), and n-r> 



(n - 



+ 1). This proves the lemma. 



B. Discussion 

For correcting narrowband noise in the powerline channel, 
it is desirable that the symbol weight be close to the minimum 



possible value of [ji/g]. It remains open to determine a large 
subset of the Reed-Solomon code for the case p < 1/2. It 



follows from Proposition 6.2 that the rate of this subset can 
not exceed the GV-type bound 1 — p — S. The more interesting 
question is whether this bound can be achieved, especially 
in the cases where r is small. The work of Konyagin and 
Pappalardi p7) gives an affirmative answer to this question in 
the case of r = 1, and for low relative distance S. They show 
that the number of permutation polynomials of degree at most 
q — 1 — d is approximately ql/q'^ for d < 0.03983(7. This is 
asymptotically, ^ \og^{ql / q"^) ~ 1 — (5, where d = Sn. 

For the other ranges of r, when r is growing with n, we 
believe that it should be possible to attain the GV bound. 
For instance, it would be interesting to prove that for large 
r,r > {k ~ l)/2, and for any irreducible polynomial g{x) of 
degree k — r — 1, there exists r distinct and nonzero points 



ai, 



, ar in F* such that the polynomial 



ai) 



ar)g{x) has symbol weight exactly r. Since most of the count 
of polynomials attaining constant symbol weight r comes from 
the count of irreducible polynomials, proving this will show 
that the rate attains the GV-type bound. We have obtained no 
counterexample on performing an exhaustive computer search 
over all such ureducible polynomials g{x) of degree k — l — r, 
with (fc - l)/2 < r < fc < 71 in all finite fields up to F17. We 
are unable to verify for larger fields because the computations 
become prohibitive. We believe that the conjecture is not true 
for very small r (when r does not grow with n). For instance 
for r = 1 the polynomial {x — a)g{x), where g{x) ^ + 2 
is an irreducible polynomial in F7, has symbol weight greater 
than 1 for every choice of a S F7. 

VII. Conclusion 

We derive the asymptotic estimates of the sizes of symbol 
weight codes. We also provide means of obtaining lower 
bounds on such codes and show that it is possible to provide 
symbol weight codes with the minimal possible symbol weight 
via recursive constructions, given we start with a known such 
code. Finally, we provided constructions of asymptotically 
good constant symbol weight codes. It remains open to deter- 
mine families of codes with positive rate and positive relative 
distance with symbol weights that are optimal or close to the 
optimal value of \n/q'\. 

Acknowledgement 

We thank the Associate Editor Navin Kashyap for pointing 
out an error in an earlier version of the manuscript, and for 
his comments, which helped us improve the presentation of 
this article. 

References 

[1] E. Agrell, A. Vardy, and K. Zeger, "Upper bounds for constant-weight 
codes", IEEE Tram. Inf. Theory, vol. 46, pp. 2373-2395, November 
2000. 

[2] M. Aaltonen, 'A new upper bound on nonbinary block codes". Discrete 

Math., vol. 83, no. 2-3, pp. 139-160, 1990. 
[3] C. Bachoc, V. Chandar, G. Cohen, P. Sole, and A. Tchamkerten, "On 

bounded weight codes", IEEE Trans. Inf. Theory, vol. 57, pp. 6780- 

6787, October 2011. 



14 



[4] I. F. Blake, G. Cohen, and M. Deza, "Coding with permutations", Inf. 

and Control, vol. 43, pp. 1-19, 1979. 
[5] Y. M. Chee, G. Ge, and A. C. H. Ling, "Group divisible codes and their 

application in the construction of optimal constant-composition codes of 

weight three", IEEE Trans. Inf. Theory, vol. 54, pp. 3552-3564, August 

2008. 

[6] Y. M. Chee, S. H. Dau, A. C. H. Ling, and S. Ling, "Linear size 
optimal q-ary constant-weight codes and constant-composition codes", 
IEEE Trans. Inf. Theory, vol. 56, pp. 140-151, January 2010. 

[7] W. Chu, C. J. Colboum, P. Dukes, "Constructions for permutation codes 
in powerline communications", Des. Codes Cryptogn, vol. 32, pp. 51- 
64, 2004. 

[8] W. Chu, C. J. Colboum, P. Dukes, "On constant composition codes". 
Discrete Appl Math., vol. 154, no. 6, pp. 912-929, 2006. 

[9] C. J. Colboum, T. Kl0ve, A. C. H. Ling, "Permutation arrays for 
powerline communication and mutually orthogonal Latin squares", IEEE 
Trans. Inf Theory, vol. 50, pp. 1289-1291, June 2004. 
[10] C. Ding and J. Yin, "Combinatorial constructions of optimal constant- 
composition codes", IEEE Trans. Inf. Theory, vol. 51, pp. 3671—3675, 
October 2005. 

[11] C. Ding and J. Yin, "A constraction of optimal constant composition 

codes", Des. Codes Cryptogn, vol. 40, 157-165, 2006. 
[12] P. J. Dukes, "Coding with injections", Des. Codes Cryptogr, Published 

onHne 03 August 2011, DOI 10. 1007/sl0623-01 1-9547-4, 2011. 
[13] I. M. Gessel and R. P. Stanley, "Algebraic Enumeration", Handbook of 

combinatorics. Vol. 2, Elsevier, Amsterdam, 1995, pp. 1021-1061. 
[14] S. Huczynska, "Powerline communication and the 36 officers problem", 

Phil. Trans. R. Soc. A, vol. 364, pp. 3199-3214, 2006. 
[15] S. Huczynska and G. L. Mullen, "Frequency permutation arrays", 

/ Combinatorial Des., vol. 14, pp. 463^78, 2006. 
[16] S. Huczynska, "Equidistant frequency permutation arrays and related 

constant composition codes", Des. Codes Cryptogr, vol. 54, pp. 109- 

120, 2010. 

[17] S. Konyagin, and F. Pappalardi, "Enumerating permutation polynomials 
over finite fields by degree n". Finite Fields and their Applications, 
vol. 12, pp. 26-37, 2006. 

[18] V. I. Levenshtein, "Upper bound estimates for fixed weight codes", 
Prohlemy Peredachi Informacii, vol. 7, no. 4, pp. 3-12, 1970. 

[19] V. I. Levenshtein, "IVIethods for obtaining bounds in metric problems of 
coding theory", Proc. 1975 IEEE-USSR Joint Workshop on Inf. Theory, 
pp. 126-143, 1976. 

[20] R. Lidl and H. Niederreiter, "Finite Fields" in Encyclopedia of Math- 
ematics and its Applications, vol. 20, 2nd Ed. Cambridge University 
Press, 1997. 

[21] J. Lin, J. Chang, R. Chen, T. Kl0ve, "Distance-preserving and distance- 
increasing mappings from ternary vectors to permutations", IEEE Trans. 
Inf Theory, vol. 54, pp. 1334-1339, March 2008. 

[22] Y. Luo, F. Fu, A. J. H. Vinck, and W. Chen, "On constant-composition 
codes over Zq", IEEE Trans. Inf Theory, vol. 49, pp. 3010-3016, 
November 2003. 

[23] J. Luo and T. Helleseth, "Constant composition codes as subcodes 
of cyclic codes", IEEE Trans. Inf. Theory, vol. 57, pp. 7482-7488, 
November 2011. 

[24] F. J. Mac Williams and N. J. A. Sloane, The Theory of Error-correcting 
Codes, North-Holland, Amsterdam, 1991. 

[25] A. W. Marshall, B. C. Amold, and I. Olkin, Inequalities: Theory of 
Majorization and Its Applications, 2nd ed.. Springer, 2011. 

[26] R. Omrani and P. V. Kumar, "Improved constractions and bounds for 2-D 
optical orthogonal codes", Proc. Int. Symp. Inf. Theory 2005, Adelaide, 
Australia, 2005, pp. 127-131. 

[27] C. E. Shannon, R. G. Gallager, and E. R. Berlekamp, "Lower bounds to 
error probability for coding in discrete memoryless channels I", Inf. and 
Control, vol. 10, pp. 65-103, 1977. 

[28] V. R. Sidorenko, "An upper bound on the length of g-ary codes", 
(Russian) Prohlemy Peredachi Informacii, vol. 11, no. 3, 14-20, 1975. 

[29] D. J. J. Versfeld, A. J. H. Vinck, and H. C. Ferreira, "Reed-Solomon 
coding to enhance the reliability of M -FSK in a power line environ- 
ment", in Proc. Int. Symp. Power Line Commun. and its Applications, 
Vancouver, BC, Canada, April 2005, pp. 100-104. 

[30] D. J. J. Versfeld, A. J. H. Vinck, J. N. Ridley, and H. C. Ferreira, "Con- 
structing coset codes with optimal same-symbol weight for detecting 
narrowband interference in M-FSK systems", IEEE Trans. Inf. Theory, 
vol. 56, pp. 6347-6353, December 2010. 

[31] A. J. H. Vinck, "Coded modulation for power line communications", 
AEU Int. J. Electron, and Commun., vol. 54, pp. 45^9, 2000. 



